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1.  Introduction 


Suppose  in  a  laboratory,  say  Laboratory  I,  a  certain  instrument  is  designed  to  mea¬ 
sure  several  characteristics,  and  a  number  of  vector- valued  measurements  is  recorded. 
Qur  objective  is  to  estimate  the  unknown  population  mean.  It  is  known,  however, 
that  a  similar  instrument  is  used  in  another  laboratory,  say  Laboratory  II  for  the 
same  purpose,  and  a  number  of  observations  is  recorded  from  the  second  instrument. 
It  is  also  suspected  that  the  two  population  means  are  equal,  in  which  case,  observa¬ 
tions  recorded  in  Laboratory  II  can  possibly  be  used  effectively  together  with  those 
in  Laboratory  I  for  estimating  the  population  mean  of  the  first  instrument.  Thus, 
the  question  that  naturally  arises  is  whether  one  should  use  the  sample  mean  from 
Laboratory  I  or  the  pooled  mean  from  the  two  laboratories. 

In  problems  of  this  type  what  is  normally  sought  is  a  compromise  estimator  which 
leans  more  towards  the  pooled  sample  mean  when  the  null  hypothesis  of  the  equality  of 
the  two  population  means  is  accepted,  and  towards  the  sample  mean  from  Laboratory 
I  when  such  a  hypothesis  is  rejected. 

A  very  popular  way  to  achieve  this  compromise  is  to  use  a  preliminary  test  esti¬ 
mator  (PTE)  which  uses  the  pooled  mean  when  the  null  hypothesis  is  accepted  at  a 
desired  level  of  significance,  and  uses  the  sample  mean  from  Laboratory  I  when  oppo¬ 
site  is  thecae.  For  an  excellent  review  of  PTE’s,  see  Bancroft  and  Ilan  (1981).  It  is 
known,  though,  in  other  situations  that  a  PTE  is  typically  not  a  minimax  estimator, 
and  estimators  with  uniformly  smaller  mean  squared  error  (MSE)  than  the  PTE  can 
often  be  produced  (see  for  example  Sclove  et  al  (1972)).  Moreover,  the  degree  of 
evidence  for  or  against  the  null  hypothesis  is  not  reflected  in  the  PTE. 

In  this  paper,  we  propose  instead  an  empirical  Bayes^ER)  estimator  which  achieves 
the  intended  compromise.  Such  an  EB  estimator  is  quite  often  a  weighted  average  of 
the  pooled  mean  and  the  first  sample  mean.  The  weights  are  adaptively  determined 
from  the  data  in  such  a  way  that  larger  the  value  of  the  usual  F  statistic  used  for 
testing  the  equality  of  the  two  populatin  means,  the  smaller  is  the  weight  attached 
towards  the  pooled  sample  mean.  Thus,  unlike  the  PTE,  the  EB  estimator  incorpo¬ 
rates  the  degree  of  evidence  for  or  against  the  null  hypothesis  in  a  very  natural  way. 
Also,  unlike  a  subjective  Bayes  estimator,  the  EB  estimator  is  quite  robust  (with 
respect  to  its  frequent.ist  or  Bayesian  risk)  against  a  wide  class  of  priors. 

Section  2  motivates  the  EB  estimator,  and  its  Bayesian  properties  are  discussed 
in  this  Section.  Among  other  things,  it  is  shown  that  the  EB  estimator  has  uni¬ 
formly  smaller  Bayes  risk  than  the  first  sample  mean.  In  Section  2,  the  estima¬ 
tors  are  compared  in  terms  of  their  frequentist  risks,  and  sufficient  conditins  un¬ 
der  which  an  EB  estimator  dominates  the  first  sample  mean  are  given.  Also,  in 
this  section,  a  modified  EB  estimator  is  proposed,  and  sufficient  conditions  un¬ 
der  which  it  dominates  the  PTE  are  given.  Finally,  in  Section  5,  a  hierarchical 
Bayes  approach  is  proposed  as  an  alternative  to  EB  estimators.  It  has  recently 


K 


come  to  our  attention  that  Saleh  and  Ahmed  (1987)  have  considered  estimation 
of  // 1  under  the  loss  =  (6  -  /t j )' V '_l(<5  —  //.f),  assuming  Vt  =  V2  =  V 

unknown,  and  proposed  the  shrinkage  estimator  xt  4-  — | ^-(x2  -  Xt)  •  where 
7’J  —  (x2  _  x1)'5_1(x2  -  x | ) ,  n5  =  pooled  sum  of  squares  and  products  matrix, 

n  =  »>  ,  4  n.2  —  2,  and  0  <  c  <  t>  ^  A  comparison  of  the  risk  of  the  above 

estimator  with  those  of  the  PTE  as  well  as  x.  and  is  also  undertaken  bv 

the  above  authors. 


2.  The  EB  Estimator  and  its  Bayesian  Properties 


Lot  Xii(t  =  l,...,n<)  and  X2l(t  =  l,...,n2)  be  independent  p(>  3)-dimensional 
random  vectors,  where  X|,’s  are  iid  iVr(/tt,a2Vi),  while  X2,’s  are  iid  /Vp(p2, cr2V2). 
In  the  above  // 1  £  Rr ,  p2  €  Rr  and  o2(>  0)  are  unknown,  but  V(  and  V2  are  known 
p  x  p  p.d.  matrices.  Our  goal  is  to  estimate  /t|. 

In  order  to  motivate  the  EB  estimator,  we  need  find  first,  a  Bayes  procedure.  It  is 
immediate  that  the  minimal  sufficient  statistic  for  (/tj,//2,o2)  is  (X,,X2,  tr(V,  1 S i  I 
V2  ’S2)),  where  X,  =  n/’  X„(.7  =  1,2)  and  Sy  =  ^.(X,, -X,)(Xy, -X;)r, y  - 

1,2.  Note  also  that  X,  ~  Np(pj,  o7n~  lVy)  (j  —  1,2),  while  tr(Vf'Si  +  V7  1 52)  ~ 
2  2 

**  X {n i  (  nj  -!)/>• 

In  a  Bayesian  frame  work,  the  above  is  treated  as  a  conditinal  distribution  given 
p |  and  p2.  We  use  the  independent  Nr(i>,  r2nj'1  V|)  and  /Vp(/^,  r2n2 1 V2)  priors  for  /r. t 
and  p2,  that  is  the  prior  variance  -  covariance  matrix  is  proportional  to  the  variance- 
covariance  matrix  of  the  corresponding  sample  mean.  The  suspicion  that  p(  and  p2 
may  be  equal  is  reflected  in  the  choice  of  a  priori  common  mean  u.  For  a  related  prior 
in  the  general  regression  model,  see  (lliosh  et  a.l  (1987). 


In  order  to  find  the  posterior  distribution  of  //.  =  y  1  j  ,  first  note  that  conditional 

on  pi  and  p2,  Xi,X2,  St  and  S2  are  mutually  independent,  and  the  distributions  of 
S(,S2  do  not  depend  on  pi  and  p2.  Hence,  we  can  restrict  ourselves  to  the  conditional 
distributions  of  X/s  given  py’s.  Also,  since  //(  and  p2  have  independent  normal 
priors,  standard  calculations  yield  that  p\  and  p2  given  Xt  and  X2  have  independent 
posterior  distributions  with 


P>|Xy  =  X,  ~  N„{{ l  -  Z7)J c;  -I-  tft/,c72(l  -  flJn-'Vy)  (2.1) 

j  —  1,2  where  [J  =  cr2/(o2  4-  r2).  Now,  using  the  loss 

L(p,,a)  =a*l(a-p1)Tg(a  //,)  (2.2) 

for  estimating  pi  by  a  (Q  being  a  known  p.d.  weight  matrix),  the  Bayes  estimator 
of  //  f  is 


en(X,)  -  (1  -  fl)X,  1  Uu.  (2.3) 

Note  that  the  Bayes  estimator  does  not  depend  on  the  choice  of  Q.  The  multiplier 
a  2  is  used  in  the  loss  because  that  makes  Xt  a  minimax  estimator  of  p,  with  constant 
risk  not  depending  on  any  unknown  parameter. 

In  order  to  find  an  EB  estimator  of  // 1 ,  we  estimate  the  unknown  parameters  H 
and  i'  in  (2.3)  from  the  marginal  distributions  of  X(,  X2  and  tr(V,  'S,  |  V2  'S2).  Note 
that  marginally  X|,  X2  and  tr(V,  * S ,  )  V2  'S2)  are  mutually  independent  with  X,  - 
/VP(i/,n;  '{a2  t  r2)Vj),(;  =  1,2)  and  tr(V,  'S,  I  V2%)  -  o2\(2„^ni  2)p.  Hence  the 


complete  sufficient  statistic  for  (z/,  r2,c 72)  based  on  this  marginal  distribution  is  (W,  Z, 
trfVr'S,  +  V2  *S2))  where  W  =  (n,Vf'  4-  nJV2-,)-,{n, Vf'X,  +  n2V2 ’X2)  is  the 
pooled  sample  mean,  Z  =  Yr(n|_1V,l-(-n2"'lV2)'1Y  and  Y  =  X]  -X2.  Also,  marginally, 

,2  i  ,21 


W  ~  NP{u,(a2  f  r2)(n, Vf1  +  n2V2')),Y  ~  W„(0,  K’V,  4  »2'V2)(eJ  f  r2))  and 


tr(V-'S,  4  V2lS2)  ~  o2x[ni+n?_2)r.  Hence,  the  UMVUE  of  u  is  W,  while  the 
UMVUE  of  (a2  I-  r2)-'  is  (p  -  2) / (YT (n t~ 1 V,  4  n2‘V2)  ’Y).  The  last  assertion 
follows  since  Yr(»,',V1  4-  2i2_,V2)“'Y  ~  (a2  f  r2)y2.  Moreover  since  tr(Vj~'S|  f 
V2'S2)  ~  <72Xn,+n3  2)P'  the  best  scale  invariant  estimator  of  o2  is  ((n,|  4n2-2)p  42) _l 


tr(V,  1 S t  4  V2lS2).  Substituting  these  estimators  for  u,  {o2  4  r2)  1  and  a2  in  (2.3), 
one  gets  the  EI3  estimator  of  /i(  as 


flBB(XltX2,S,,S2)  -  (1  -  B)Xx  4flW  =  W  4  (I  -  B)(X,  -  W) 


(2.4) 


where 


B  = 


(p  ~  2)tr(V,~,SI  4-  V2  *S2) 


((ni  4-  n2  -  2)p  +  2)Y T(n;'V{  f  n2-'K2)-»Y 


(2.5) 


Remark  2.1  Note  that  0  <  B  <  1,  while  the  estimator  B  though  positive  can 
values  exceeding  one.  Accordingly,  for  practical  purposes,  one  proposes  the  positive 
part  KB  estimator 


pJUX„X2,S„S2)  -  W  4-  (I  -  /?)'(X,  -  W).  (2.6) 

of  p. |,  where  a K  -  max(<?,0).  For  simplicity  of  exposition,  in  the  remainder  of  this 
section,  we  shall,  however,  work  with  e^B  rather  than 

A  question  that  naturally  arises  is  why  this  particular  method  of  estimation  is 
used  for  estimating  the  prior  parameters.  We  shall  answer  the  question  by  proving 
the  “optimality”  of  eKn  within  the  class  of  estimators 


A.(X ! ,  X2,  S, ,  S2)  -  W  4(1 


Ctrl  'S,  l  V2'S2) 


((".  4  n2  -  2)p  -  2)YT  (n(  1  V2  4  n2'V2)  <Y 


)(X,  -W). 


(2.7) 


where  c(.>  0)  is  a  constant.  Note  that  ePP  =  f>r_  2. 

Theorem  2.1  The  Bayes  risk,  of  f>r  under  the  assumed  prior  (say  f)  and  the  loss 
(2.2)  is  given  by 


r{LF)  ~  ( I  B)nt  'tr(yV'i)  I  Htr(^(n,  V,  1  |  rr2 1 '2 


i  \  n 


t  /?tr(gA(n,,V,  I  ti2  1 V2)  AT  ) 
c2  ( n  |  I  n  2  2) 


2 r(n,  f  n 2  -  2) 


{ (n.|  f  n2  -  2 )p  \  2 }(p  -  2)  (n|  I  n7  -  2 )p  I  2 


4  1 


(2.8) 
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where,  A  is  defined  following  (2.11)  below. 


Proof : 

The  second  part  of  the  theorem  follows  immediately  from  (2.8).  To  prove  the  first 
part,  write 

r(£,6c)  =  r((,eg)  +  o~2  E\(en  -  Sr)rQ(er,  -  ^))  (2.9) 

Note  from  (2.1)  to  (2.3)  that 

*'U,eB)  =  (l-BK,tr(QVl)  (2.10) 

Also,  writing  lie  -  ctr(Vf ‘S,  +  V2'S2)/{n,  f  n2  -  2)/>  +  2)YT(nj-,V,  -h  n2 'V2)_l  Y}, 
one  gets 

efl  -  6<  =  (1  -  D)Xi  +  Du-  W  -  (1  -  ^)(X,  -  W) 

=  -fl(W -!/)  +  (&-  D)(X,  -  W) 

=  -B(W  -  u)  +  (B,  -  B)\Y,  (2.11) 

A  =  (n.V,1  +n2V-,)-,naV,'. 

Next  using  the  independence  of  W  and  (Y,  trCV^St  +  V2  *S2))  and  the  facts  that 
E(W)  =  v,  Var(W)  =  (a2  +  r2)(n, V,-' +  n2V2 ')- 1  =  o2  B~ 1  (n,' Vf 1  +  n, V2  1 )‘ ' ,  one 
gets 

/V((eB  -  6,)rQ(ep  -  Se) \ 

=  B2E\{ W  -  v)rQ( W  -  i/)|  !-  E[(0e  -  Z?)2YrArQAY| 

-  fT2BU{Q{n{Vx-' -\-n2V2~')-'}  +  E\(Br-  B)2Yt\tQ\Y]  (2.12) 

Now  we  find 


E\{Br  -  f?)2 Yr  A7  QAYj 

«,  «*{tr(^.  %  '  ^2  %)}2 

'>{(n,  1  n2  -  2)p  I  2HVrK,*/i  +  ”2^2)  1 Y}2 
2 Be  tr(V,  ’.S,  1- IV’-S)  (y7 

{(".  1  n2  2)r  I  2}  { Yr (n,  1  V\  +  n2'U2)'Y}1 
1  f?2(YrATgAY)| 


(Y7  ArgA  Y) 


(YrArgAY) 


(2.13) 


Using  the  independence  of  Y  and  tr(V1  1 S 1  +  V2'S2)  along  with  the  fact  that 
tr(V,  'S|  1  V2  'S2)  ~  o2x2n|fnj  2^r,  it  follows  that  the  right  hand  side  of  (2.13) 


-  E\ 


c2o ,'(n1  4  n2  -  2 )p 


YtAtQ\Y 


1  (n,  +  n2  -  2)p  +  2  {Y^nf’E,  4-  n,"1  V,)-' Y}2 
2Bm2(n,  4-  n,  -  2)p  YTArQAY 


(n,  +  n2  -  2)p  +  2  {YT(n,-‘l/1  +  n^'VjJ-'Y} 


+  Z?2(YrArQAY)| 


(2-14) 

Next,  observe  that  Yr(ni"1V,  |-  rij'V'^’Y  is  a  function  of  the  complete  sufficient 
statistic  while  (YT  AT  Q A  Y)/(Yr(nj'1  Vt  4  ^"'^(’'Y)  is  ancillary.  Now  using  Dasu’s 
theorem  (or  Lemma  1  of  Ghosh  et  al  (1987))  along  with  F,(YT ATQAY)  =  (a2  +  r2) 
tr(gA(n.|-,K1  +  n2-,K2)Ar),£;(Y7'(n)'1Kl+n2-|V2)-,Y)  -  p(o2  +  r2),  and  E[YT[n^V{  + 
2 1  V'2 )  1 Y )  1  —  (a2  •(-  r2)~'(p  -  2)~',  it  follows  that  the  right  hand  side  of  (2.14) 


c2o2B(nl  4  n2  -  2)/>tr(<?A(n1  1  V,  I  n2'V2)Ar) 
{(n,  4-  n2  -  2)p  4  2}p(p  -  2) 

2 co2 B{nx  4  n2  -  2)p\.r(Q  A(n~{ 1 Vx  \  n 2 1 F2 ) A r ) 
{(n,  f-  n2  -  2)p  4  2}p 
4-  a2  BU(QA(i>  j" 1  Vx  4  n2'K2)Ar) 

It  follows  from  (2.12)  -  (2.15)  that 


(2.15) 


/V[(ep  fir)TQ(on  A^)| 

-  *2Bl.r(CJ(»,V,  '  4»2E2  ')•')  t  a2BU{QA{v,  '  F,  |  n.2'V'2)Ar) 
c2(n,  f  u2  2)  2c(n,  |-  n2  -  2) 

{ (r» |  I  n2  -  2)p  I  2}(p  -  2)  (n,  4  n2  -  2)p  4  2 


(2.10) 


The  proof  of  the  theorem  is  complete  from  (2.9),  (2.10)  and  (2.10). 

Next  we  compare  the  Bayes  risks  of  pEp  and  X|.  Note  that  X|  has  constant 
risk,  ami  hence  constant  Bayes  risk  (under  any  prior)  o2n,  ,tr(gi/i).  Rather  than 
comparing  the  Bayes  risks  of  pEp  and  X|  directly,  we  find  it  convenient  to  introduce 
the  notion  of  Relative  Savings  Loss  (RSL)  as  in  Efron  and  Morris  (1975). 

lror  any  estimator  o  of  //.],  the  RSL  of  prn  with  respect  to  p  (under  the  prior  £) 
is  defined  as 

RSL{$;pep,p)  =  lr((,cEn)  -  r((,pn)l/lr((,p)  r(£,e„)J 

-  1-  |r(£,e)  - ■  r(£,0|rn)|/|r(f,p)  -  r(£,eR)|  (2.17) 

This  is  the  proportion  of  the  possible  Bayes  risk  improvement  over  p  that  is  sacrificed 
by  the  use  of  pep  rather  than  the  ideal  Pp  under  the  prior  £.  From  (2.8)  with  r  -  j>  2 
and  (2.10),  it  follows  that 


ygiji jjjm JJVJ'A'JL  'A'XMV/J  V.  ^WV’J  ,>"J.',X,>  ">.V  ">.’>  '>  V  ->  -.»  -j-  v  ->  v  v 


nShtf\eF.n*Xx) 

-  Mg^.v,'  i  »2 v 2 1 )  1 ) 

C  U(«?*(n,  'l',  ,,,  V,)Ar)  (j,^'„7”-V),r!:  j)11”'  'trW'')l"'  <2  '"> 

Note  I, ha!,  I, hr  above  RSI,  expression  docs  not,  depend  on  any  unknown  parameter. 
Also,  from  (2.18),  since  2(n.|  I-  n2  -  I)  <  (ut  f  n2  -  2 )/>  I-  2,  arid 

(».V,  '  )■  njVj  I  A ( n ,  1  F,  f  »2'l2)A7'  =  »\'VU  (2.19) 

it.  follows  I. hat,  RSI,(£;  oF:p ,  Xt )  <  I  which  is  equivalent,  t.o  r((,pK„)  <  r(£,X|). 

'I'hns  oEP  has  smaller  Bayes  risk  than  Xt.  A  proof  of  (2.19),  follows  easily  from 

simultaneous  diagonalization  of  V{  and  F2.  Alternatively,  writing 


A  -  (n.,1'',-'  I  ujVj-1)  'njVj'1  —  ( V'2  '(»|V2  I  »»2l/i}l/r'r  * ”2 ^2 

-  n^Vtfc'V,  I  »7'V2)  \ 

it  folows  that 


(»,!',  ’  I  »i)\  ')  ’  I  A ( n 1  \\  I  n,,V',)Ar  -  n7' \V7  I  n.,  1  At7, 

-  A(h,  'V,  I  v7'V2)  -  ni'V, 

which  is  (2. 19). 

Finally,  in  this  section,  we  compare  the  Bayes  risk  of  r>FP  with  that  oT  W.  Note 
that  11’  has  Bayes  risk 

r(£,W)  -  r(*,r„)  I  o  1  E\(rn  W )rQ(rn  W)|  (2.20) 

Since  e„  W  -  (I  B)X,  I  Uv  W  -  //(W  „)  |  (I  /?)(X,  W)  - 

/f(W  i/)  I  (I  /?)  AY  where  A  is  defined  following  (2.11),  rising  once  again  the 
independence  of  W  and  Y,  it.  follows  that 


/■;|(r„  W)Tg(r„  W)) 

olmr(g(n,  V,  '  I  n2V,  ')  ')  I  (I  B)7/-;(Y7  A'  gAY) 

-  o7mr(g(»,v, '  I  njVj')-')  (  o7(i  n)2n  'u(q\{h{  'f,  t  »2'f2)at  (2.21  > 

rims  from  (2.10),  (2.20)  and  (2.21), 


r(^W)  -  n,'(l  -  /?)tr(g^)  f  /ftr^^.V,  '  I  n2V2')  ') 

+  (l  -  H)2J9-'tr(gA(n7'l/,  I  n2 1  V’2) Ar) 


(2.22) 


Finally,  from  (2.8)  with  c  =  p-  2,  (2.10)  and  (2.22),  it.  follows  that,  RSL(£;  eKfli  W)  — 
K(.pFn)  -  r(«ti Pn)|/|r(x.  W)  -  r(£,r„)j 


tr(Q(„,V,'  i  n,V,  V)  K^V,)^) 

lr(Q(ni V,  »  4-  ,?2V2  ')->)';  (1  -  5)*D:nr(QA(»rrKr  4-  n2  ’V'V)Ar) 

which  is  loss  than  one  if  and  only  if 

{(1  -  ft)/ ft}2  >  2(n,  h  n2  -  l)/{(n,  4-  n2  -  2)p  4-  2} 


(2.23) 


(2.24) 


Remark  2.2  The  fact  that.  does  not  dominate  W  uniformly  is  not  at.  oil  sur¬ 
prising.  If.  for  example,  t 2  is  very  small  and  p.\  is  nearly  degenerate,  at  i /,  then  W 
is  much  closer  to  v  than  Opp.  Indeed,  in  this  ease  H  =  o2/(o2  -|-  r2)  is  very  close  to 
I  so  that  (2.2\)  cannot  hold.  However,  when  o2  <  r2.  then  13  <  !*-+([  -  ft)/ft  >  I 
so  that  (2.2\)  holds. 


A*  A*  A*  A  ■  ‘  ^  A' 


A*  S-’  V  V 


/  v.y.v 
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3.  Minimax  Estimation 

It  is  well  known  that  under  the  loss  given  in  (2.2),  X|  is  a  minimax  estimator  of  pt 
with  constant  risk  nj  1  tr(QV,).  In  this  section,  first  we  find  a  class  of  estimators 
including  eEB  as  a  member  which  dominates  Xi  under  certain  conditions,  and  then 
investigate  whether  e^p  satisfies  these  conditions. 

With  this  end,  first  write 


F  =  (Yr(uj‘ll/i  -i-  n2  lV2)~'  Y)/{tr(^1-,5,  +  %)/((”,  +  n2  -  2 )p  +  2)}  (3.1) 

and  consider  the  class  of  estimators 


/if  =  X,  -MF)/F)(X,  -  W)  (3.2) 

for  estimating  pi.  Note  that  epp  belongs  to  this  class  with  <t>(F)  =  p  -  2.  We  now 
compute  the  frequentist  risk  of  the  estimator  f't  (i.e.  without  any  reference  to  the 
prior  £).  Throughout  this  section,  E  denotes  expectation  conditional  on  pt  and  p2. 


Theorem  3.1 


m\{nt  -  -  **.) 

=  n-‘tr(QV,)  -  2E 

i+*(n 


/O2 

+{F) 


tr(ArQAV)  +2 (<t>'(F)  -  ^) 


<f>(F).  YrATQAY 


YTV-'Y 


+  <f2E\'-jrL  Y7'ArQAY| 
where  V  =  n\x V (  -f  n2  1 V2. 


(3.3) 


Froof :  First  write 

£|(m?-m,)tQ(m?-m.)] 

-  F((X,  -//.,)rQ(X,  -//,) 

-  2(<*(F)/F)YrArQ(X,  -  ;it) 

I-  (<f>2(F)/ F2)YT  AtQAY]  (3.1) 

where  we  have  used  the  fact  that  X|  —  W  =  AY.  Next  writing  X|  =  W  +  AY  and 
correspondingly  pi  =  p.  +  A/tn  where  p.  —  (n f V,- 1  f  n2V2  ')  1  (n|Vr,  ’p i  +  n2V7  '//2) 
and  pn  =  Pi  —  p2,  one  gets 


F!(<*(F)/F)YrArQ(X,  -  p,)| 

=  F|MF)/F)YTArQ((W  -  P-  f  A(Y  -  p0))| 
-  E\{4>(F) / F)YT \T Q K{Y  -  po)] 


(3.5) 


where  in  the  final  step  of  (3.5),  one  uses  the  independence  of  (Y,tr(V/,  '.5)  f  V7  *52)) 
with  W  as  well  as  F( W)  —  /i..  Now  since  V  is  p.d.,  there  exists  a  nonsingular  D  such 
tilt,  D  ' V(D  ')r  -  Ip.  Write  Z  —  D  *Y  and  r,0  -  D  Then  Z  ~  /Vp(*/n,o2 Ip). 
We  rewrite 

Yr  ArQA(Y  -  /in)  -  ZrU(Z  -  r/n),  (3.6) 

where  l.J  =  ((»,;))  —  DrArQAD.  Also,  in  terms  of  Z,  F  =  Zr Z/(tr(l/|“l5,  + 
V2  ,Sj)/((nl  I  n2  2 )p  t-  2)}.  Now  using  Stein’s  identity  (cf.  Stein  (1981)),  the 
independence  of  Z  and  tr(Vy 'F,  (-  V2  '.S2),  and  (3.6), 


F|(<A(F)/F)ZrU(Z  -  r/n)| 

2  F  0  j*[F)  f 

h  ’  ’ 


(/)  W) _ 

F  r2  I  ftr[vv'.s  I  V 2  '.S)/((n,  f  n.2  2 )p  +  2*} 


-  m.iM.tim  .*12).. _ ztz  _  , 

f  I  I'  r’  |  jir(c,  '5,  I  r,  's,|/((n,  +  -  t)r  i  2)} 

-  f^’WA-QAV)  ^  1  . 

F  v  *  '  If  F2  I  YrV  1 Y 


crJF  ^tr(ArQAY)  I  2  j  <f>'(F) 


F  F2  j  Yr  V  «Y 

((n  jtni  yt\tq\y' 

[  1  F|  Y7'V  1 Y 


The  theorem  follows  now  from  (3.3),  (3.4)  and  (3.7). 

Next  in  this  section  we  find  an  upper  hound  for  F|(^2( F)/ F2 ) Yr A 7  Q A  Y |.  We 
first  get  the  inequality 

F|(^(F)/F2)(YrArQAY)| 

*2(F)  YT\TQ\Y  t. r ( V-j  'F,  I  l2  'f2)  , 

''  jf-i  '  '  V7  V  '  Y  '  {(»»,  I  i*2  2);»  l  2}  1 

-  ch  \  {  \  T  Q  \  V  )  fj\h2  {  F)  F  tr(V,  ’.S'.  I  V2  1  ,S2  )/((*,,  l  „2  2)/»  l  2)| 


where  chi  (A7  Q  A  V)  denotes  the  largest  eigen  value  of  A 1  Q  A  V  and  h(  F)  -  f/>(  F) ,  F 
Next,  applying  (2.18)  of  Kfron  and  Morris  (1976),  one  gels 


/;|/»2(F)Ftr(V'|  *F,  |  Vj  ' Fj )/((** !  *  *»2  2)/»  *  2)| 


g|  i^+ .  h>{F)F  +  - ?_ - tr(^S,  ±  VV% 

(n.j  4  r>2  —  2)p  4  2  (n.i  4  r?-2  -  2 )p  +  2  (uj  4  rt2  —  2)/)  4  2 

(2/t(r)/t'(f)F  +/.1(F))(~-  ■■!  -  ,  -  -r-C - XT— t)| 

tr ( k |  02  +  k2  *52)/((ni  +  n2  —  2 )p  2) 

^  E  [  (n»  ±  ~  2)P  .  <^(F) 


[(nj  +  n2  —  2)p  4-  2  F 

_ t _ J2(£ 

(n,  4  n2  -  2)p  4  2  [  \ 

f*’(F)  4 


{2(£m.^)w  +  ^)}] 


=  l^T1  ~  fr,+n,-  2),  +  2^'<£>j  ,3'9» 

From  (3.8)  and  (3.9),  one  gets 


(YTATgAY) 


<  f7,chi(AT(?AV')i? 


W) 


(rij  4  n2  —  2)p  4  2 


<£(W(F)  (3.10) 


Combining  (3.3)  and  (3.10),  one  gets 


°-2E\{n*  -  //,)rQ(/rf  -  mi)  -  (X,  -  m,)tQ(X,  -  //,)) 
<  [^A'QAV,  +  2(*'(F)  -  £0-) 

i  . i.  t  l  T l -it \  tt* 


+  'MA'QAV)F  -_+.^(F)^(F)j  (3.U) 

The  following  theorem  is  now  easy  to  prove  from  (3.11).  Recall  that  A  =  (niV,-1  4 
n2l2  ')  'n^-1  and  V  —  nJ"*K|  4  n2 ‘V^. 


Theorem  3.2  Suppose  that 

( t )  t.r(A7QAF)  >  2ch|(ArQAl/) 

(") 0 "  *(H  <  -  2I  and 

(itt)  4>(F)  t  in  F 

hold.  Then  o7E\(p*  -  p\ )T(J(//.f  //,)  -  (X,  -  p,)TQ(X,  -  pt)j  <  0  for  all  p ,  and 


O  **  O.  c  CToWa*  e  A*  V. O. OO  a  v"  i*  •  aH.*1  \  . 


<■.  -*  S' 

Lvl  JU 


« 

4 
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Proof  :  Using  (iii),  it  follows  from  (3.11)  that 


o~2E\(fi*  -  p .i)TQ{,4  -  Mi)  -  (X,  -  /j,)TQ(X,  -  //.,)! 

<  2,  [-^>tr(A’'QAV,  +  2*&  ■  +  ^—jflchi(ATQAV) 

<  2 E  -^itr(ATgAU)  +  2^ch1(ArgAU)  +  ^^ch1(/rgAU) 

-  »[-«?*  -.)-«)] 


(3-12) 


using  conditions  (i)  and  (ii)  of  the  theorem. 


Remark  3.X  It  is  an  immediate  consequence  of  the  above  theorem  that  if  condition 
(i)  of  Theorem  3  holds,  and  0  <  p  -  2  <  2[(tr(ArQAl/)/ch|(Ar<^AI/)  -  2),  then 
the  ER  estimators  cEP  dominates  Xp  In  particular,  if  Q  —  V(  =  V2  =  Ir,  then 
\.r(\T Q\V)  —  p  ch)(ArQAVr),  and  hence  aFp  dominates  X|  for  p  >  3. 

In  the  remainder  of  this  section  we  show  how  a  modified  Ell  estimator  can  domi¬ 
nate  the  PTE.  Once  again,  an  appeal  to  Theorem  3.1  is  made. 

A  PTE  6pte  of  Mi  is  °f  the  form  f>PTFi  =  g(F)~X. |  +  (1  -  <7(0) W  =  X(  -  (I  — 
<7(/'’))(X|  -  W)  where  g{F)  =  for  some  positive  constant  d,  and  /  denotes  the 

usual  indicator  function.  The  choice  of  d  is  governed  by  the  level  of  significance  that 
is  used  for  testing  ff0  :  M\  —  Mi-  We  propose  the  rival  estimator 

f>MEP  =  X,  -  (1  (I  -  ^)<7(F))(X,  W) 

=  W  +  (l-£)g(F)(X,  -W)  (3.13) 

which  is  a  modified  version  of  eFp  with  p  —  2  replaced  by  a  general  c.  Note  that 
—  W  when  g(F )  =  0,  but  b\iFp  =  fiFp  when  g(F)  ~  1.  The  following  theorem 
is  then  proved. 

Theorem  3.3  Suppose  condition  (i)  of  Theorem  J?  holds  and  0  ^  r  2| 

2|.  Then 

o  7  E\(6mf.b  ~  Mi)TQ(b\irzn  ~  /'i)  -  (•We  -  M\)T Q(<We  -  /^i)|  <  0  (3.14) 

for  all  Mi  ott-d  Mi- 


Write  MF)  =  F{  l  -  g(F))  and  <f>2(F)  =  F(1  -  (1  -  ^)g(F))  =  <f>t{F)  f  cg(F). 
Then  hrTK  =  X,  -  (*i(F)/F)(X,  -  W)  while  /Wa  =  X,  -  (^(F)/F)(X,  -  W). 
Note  that  both  <j>i(F)  and  <f>i{F)  are  differentiable  everywhere  except  at  F  =  d.  Thus 
4>\{F)  and  </>2(F)  are  defined  a.e.  (Lebesgue).  Moreover,  4>\{F)  -  <i>2[F)  =  -eg (F), 
4>]{F)  -  <t>\{F)  =  -c2g2(F)  =  -c2g(F)  and  4>\{F)  =  <f>\{F)  =  1  -ff(F)  a.e.  Lebesgue. 
Then,  applying  Theorem  3.1  twice  once  with  (f>{F)  —  <t>2[F),  and  next  with  <i>{F)  — 
4>\(F),  one  gets 

Left  hand  side  of  (3.14) 


+ 

< 

4- 

< 


-2E 
o~2E 
-2E 
a  2E 
-2E 


^(n(/Armln  2  Y7AtQAY 

F  tr<A  QW)~  fMF)  yrv/_ly 

cV(F) 


(YrATQAY) 


F2 


clL?hr(\TQ\V)  -  lcg(F)c\U(\TQ\V) 
v  r 

cV(F)  tr(iy I-  T2-’52)  YrArQAY 

F2  ‘  '  (n,  4-  n2  -~2)/>  +  2  '  YrV”'Y 

C<r(F)tr(ATQA^)-^.chl(ATQAK) 


F 

ff"2ch,(ArQAV’)F 


F 

«|V(r_)  F 

F2 


tr(ArQAT) 


(r?-!  4  n2  -  2)p  f  2 


(3.15) 


Applying  (2.18)  of  Efron  and  Morris  (1976)  again  with  <£(F)  =  g(F)  so  that  <£'(F)  -  0 
a.e.  Lebesgue,  one  gels 


E 


92{F) 


F2 


Ftr(l/,  1 5,  h  VV'S2)/((n,  I- n,  -  2)p  +  2) 


a2F[(72(F)/F|  -rTJF|(7(F)/F| 


Now  from  (3.15)  and  (3.16),  left  hand  side  of  (3.14) 


(3.16) 


p; 


<•9  (F) 


F 


rh,(ArgAL) 


L  /  *r(ArQAr) 
l  \  r  h !  ( A  7  <7  A  l ' 


(3.17) 


by  using  the  upper  bound  of  r  given  in  this  theorem.  The  proof  of  the  theorem  is 
complete. 


Remark  3.2  Note  that  when  Q  =  V\  —  Vj  =  Ip,  the  conditions  of  the  theorem  hold 
when  0  <  c  <  2 (p  —  2),  and  in  particular  when  c  —  p  —  2,  p  >  3. 


•■.v.v.V.VV 


,n,'  ,v  .% 


4.  Hierarchical  Bayes  Estimation 

Section  2  is  devoted  to  classical  empirical  Hayes  estimation,  i.e.  when  the  unknown 
prior  parameters  are  estimated  by  classical  methods  of  estimatin  such  as  uniformly 
minimum  variance  unbiased  estimation,  maximum  likelihood  estimation,  best  invari¬ 
ant  estimation  etc.  Instead,  one  can  assign  prior  distributions  (proper  or  improper) 
to  the  hyperparameters,  and  come  up  with  hierarchical  Hayes  (HU)  estimators  of  p , . 
Note  that  in  a  classical  RH  approach,  the  lower  stage  Bayesian  analysis  is  performed 
as  if  the  hyperparameters  were  known  a  priori.  This  approach  ignores  the  error  as¬ 
sociated  with  the  estimation  of  the  hyperparameters.  On  the  other  hand,  the  HH 
approach  models  the  uncertainty  of  the  hyperparameters  by  the  second  stage  prior 
Accordingly,  unlike  positive  part  RH  estimators,  the  IIB  estimators  are  smooth,  and 
bear  the  potentiality  of  being  admissible. 

To  introduce  the  IlH  model,  first  note  that  as  in  Section  2,  one  may  start  with 
the  minimal  sudicient  statistic  (X|,X2,  tr(Rt_,S|  f  V2  1 52 ) ) .  Write  r  1  -  a2  and 
(pr )  1  —  r2  i.e.  p  —  cr2/r2.  Now  conditional  on  P\,p2  and  r,  X,,X2  and  U  — 
tr(\/,  '.S't  I  V2  '  S2)  are  mutually  independent  with  X,  —  /Vp  (// , ,  (n  ,r) '  1 V, ),  X2  -- 
^p(/,2’(” 2r)_'V2)  an<l  U  ~  r  1  \2n|4n2_2)f,.  Next  we  assume  that  conditional  on 
p  and  r,  //(  and  p.2  are  mutually  independent  with  p\  ~  N(u,  (rp) " 1  n\ 1  Rj )  ami 
Pi  ~  N(p,  n2  1  (pr)  '■  V2) .  Also,  it  is  assumed  that  u,  p  and  r  are  mutually  independent 
with  p  uniform  on  Rr ,  p  has  the  type  II  Beta  distribution  with  pdf  ht(p)  oc  pm  '(I  | 
p)  >n|  where  rn(>  0)  is  known,  while  r  has  a  gamma  distribution  with  pdf 

hi (r)  oc  exp(  -  Ifrrjr*  *,  o(>  0)  and  />(  >  0)  being  known.  We  shall  aim  at  finding  the 
posterior  distributin  of  //.  —  (fiJ.P2  )'  given  X|,X2  and  u. 

First  note  that  the  joint  prior  distribution  of  //|,//2,//,  r  and  p  is  given  by 

f(Pi,P2,P,  r.p)  oc  (pr)r 

•exp[  ^r{n,(//,  -  is)TVx  1  (/y ,  -  i/)  I- n2(/<2  v)T  V2  '  (p2  ~  p)} 

h\(p)hi(r)  (4.1) 

Next  observe  that 

”i(/'i  ^)rb j  '(/'i  '')  I  (P2  !')7  b2  1  (pi  /') 

=  \ii/  -  p-Vv.  ]{i'  -  /'•)) 
t  "| /'[F,  Vi  *  ”2P]V2'fi 2  plv.  'p.  (4.2) 

where  one  may  recall  that, //.  --  (r?  , Vj  1  I  -n2V2  1 )  ' 1  (n  t  T, ' i  I  »2V'2  1  / /  2 )  —  (R.  ’ )  ” 1  ( r7 ,  \ 
n 2 V *2  '//?)  with  R.  1  —  U|R,  1  I  n2R2~'.  Now  integrating  with  respect  to  u,  one  gets 
the  joint  pdf  of  pi,p2,  r  and  p  in  the  form  /(/'i  ,/»2,  »',/>) 


oc  (pr)r/2  exp|  Pr{nui]\'\  'p  i  I  n2p12\\'p2  p]V\  V}|M/»)Mr)  (4.3) 


The  exponent  in  (4.3)  is  easily  simplified  as 
V,  |  n2/t[K2  '/t2  -  nT.V~xn. 

=  /x[{n,K,  '  n.lY'K.n.r,'}/*,  4  n2V2'V.n2V^} 

-  /x[u,V,|_,l/.n2VrY,/t2  -  filniV2’iV.n,Vi~lni  (4.4) 


where  V.  =  (n|Vj  1  t  n2Vj  *)  '■  Also,  the  joint  pdf  of  X1(X2  and  U  conditional  on 
/X|,/x2  and  r  is  given  hy 

/(Xj,X2,u|/i,,//.2,r) 

oc  rr  exp|-r/2{nt(X|  -  /X| )TU,~ 1  (x,  -  ft i)  +  n2(x,  -  /z2)7V2" 1  (x,  -  /x2)}| 

•  exp(-ru/2)u(n,+nj' J)r/2",r(n' fn5_J)p/2  (4.5) 

Next  we  calculate 

G’  =  n,(/x,  -  x,)rVV '(/x,  -  x,)  4  n2(//2  -  x2)rV2  '(^2  -  x2) 

I  <-  -  /xJVy'/x.}  (4.6) 

which  is  needed  to  derive  the  posterior  distribution  of  //  given  X|,X2  and  u.  Using 
(4.4)  and  straightforward  algebra,  one  gets 

a  =  //[  T)}  i/x,  4  /x[D22/x2  -  2/x[D,2/x2  -  2n,x[VYVi  “  2n  2x[U2_l//2 

I  n.|x[tY  'xi  4  n2x[V2_lx2  (4.7) 

where 


-  n,V,-' 

4  n{n{V~' 

-  n.U.-'U.n.U,  '}, 

p22 

-  n2U2  ' 

4  />{n2lY'  - 

-  H2U2  ‘V.n2V2  '}, 

1)  12 

1  V\r?2V2~  1 

(4.8) 

We  now  write  G'  as  (V (  I  G2  where 


£*i  ~  |(/xi  -  AiiX|  -  A i2x2)r Dn (/x i  -  At|X|  A 1 2 x 2 ) 

♦  (/x 2  A2,X,  A22X2)t /422(/x2  A2IX,  -  A22X2) 

2(/X|  A||X|  A  i2x2)r  /4t2(/x2  A2|X|  A22X2)J 


(4.9) 


I 


<i2  ~  j rt , *X|  f  n2x[l,!  '^2  -  Mi  i t  A 1 2^2 ) 7  D\  i  M  i  i^i  t  ^12^2) 

“  (^21^1  I  ^22X2)T/^22(^2|X|  +  ^22^2) 

f-  2(/lnX1  f  A\iXi)t D\i{Ai\5i\  (  A22X2)]  ( 

From  (1-7),  (4.9)  and  (1.10),  it.  follows  that,  yt , , ,  Al2,  A2,  and  /I22  satisfy 


I)\iAi\  -  D\2  A7\  11 1 V ,  ' 


(4.10) 


1^22  ^  1 
»uA  1 
O22  A 1 


^(2^12  71 2^2 

/^|2^22 

i?T,Au 


(4.11) 


whirl)  can  ho  rewritten  as 


A|2  D}'  /,12^22*  ^21  —  l)72  l)]2A\\, 

(Du  l)n»n1)n)Au  -  n.F, 

[Dn  D]2l)u'  Dx2)A22  -  „2V2' 


(4.12) 


Tho  following  lomnia  whoso  proof  is  given  in  tho  Appendix  is  crucial  to  further 
simplify  <r2.  Recall  that  B  ~  cr2/(n2  I  r2)  —  p/(l  I  p)  and  W  -  (rijV,'1  I 
"2V2  ')  '("i''i  1  x ,  I  11 2V2  * X2)  ~  V.(n,V',  1  x ,  |  n2V2'x2). 


Lemma  4.1 

/1nXi  f  A12X2  ~  (I  Af)X|  t  B W  --  l>,(say) 

/1 2 1  x  1  t  ^22X2  ( 1  B ) x 2  f  B b2(say) 

From  (4.10),  (4.13)  and  (4.14),  tf'2  can  ho  simplified  as 

(.'2  -  n,xfl',  1 X )  I  tl2x]\r2  1 X  2 

((I  B)x]'  I  BWr}l)u{{\  B)xt  1  /tw| 
{(I  //)x[  1  /?Wr)/;22{(l  /7)Xj  1  /?W) 

I  2 { ( I  /?)*[  I  /lWr}f;,2{(l  B)x2  t  /fW} 

-  xflrr.l/  !  {(I  B)I 


(4-13) 


(4.14) 


BtuV,'V.)  Dn{{  I  B)l  l  Bn,V.\\  ’} 
(Bn,\\  'V.)I)22(Bv,\\ V',  ') 


t 


J  8 

f  2 { ( I  -  B)l  f  BniV-'V^D^BrttV.V-1) |x, 

I  X2r|n2Vy -{(I-/?)/ 
f-  Bn2V2'V.}I)u{(  1  -  fl)/  4  Bn2V.V2'} 

{Bn2V2lV.)Dn(Bn2V.V2l)  \  2{(l  -  B)I 
I-  Bn2V2xV.}D]2{Bn2V.V2x)  |x2 

x[|{(l  -  B)l  4-  BnxV;'V,)Dn{Bn2Vy2  ') 

4  {Bn,V;xV.)D22{\  -  B)I 

4  Bn2V,V2'}  +  2{(1  -  B)I  4-  BnxVx 'V.}Dn{(\  -  B)l 
I  Bn2V,V2'}\%2 
-  xT\(Bn2V2~'V.)Du{(l  -  B)l 
4  BniV.V-1}  4  {(1  -  B)I  4-  Bn2V2'V.}D22 

B(nyV. V-')  r2(Bn2V2-'V.)Di2(BnlV.V-')\xl  (4.15) 

From  (4.8),  one  gets 

B  ii  I  B22  -  2I)\2  ~  (I  -I- p)(ri  ( 1  f  ti2V2  ' )  p(n\ b|  I  r,2^2  )l-.(r?|V|  -f  fh^2  ) 
—  7i. | V*|  I  n2V2  '(since  T,  1  —  n.jK,  1  I  n.2V2  ') 

■=  C  (-*••«) 

Using  (4.8)  and  (4.16),  it  is  possible  to  simplify  G2  considerably.  This  is  done  in  the 
following  Lemma  whose  proof  appears  in  the  Appendix. 

Lonmin  4.2  G2  /4|xf{ii|T,  1  -n,  V'," 1  V.ny  V,  '  }x,  I  x\  {n2Vf  1  -  n2V2  lV,n2V2 }x2 
2x7  (rnV,  'V.n2V2  1  )x2| 

Therefore,  from  (4.9  ),  Lemma  4.1  and  Lemma  4.2,  G  can  be  written  as 

G  (/M  b|)7  D\ i (//- 1  -  l>i)  4  {ft2  -  b2)T D22(h2  -  b2)  -  2(//-i  b|)rDi2(/i2  -  b2) 

I  +  X,  t  Xt2D22  +  X2  2x[/;,2  +  X2)  (4.17) 

where 

Du*  =  n.T,  '  -n.VV’U.n,!/1 
B22*  v2V2  1  n2V2  'l  .  n2Vj  1 

/;12t  -  ny\\'V.n2V2'  (4.18) 


Returning  to  (4.3)  and  (4.5),  the  joint  pdf  of  X| , x2,  w, // ( ,  //.2,  r  and  p  is  given  by 


■  ■  r  »  ■! 


/  -■  «.  r. 


'  v-  <-  ^  •*.*  •/  %  V  ^ 


20 


Hence,  from  (-1.2!)  and  (4.23),  one  gets 


/(Xi,X2,u,r,p)  a 


/,r/2r(n.+"J -i)r/*(1  4  p)-r/2exp( +  BSS„  +  a)| 
H(«.+"J-l)p/*-l  .  I  p)  -(m+  I)  (4.24) 


Integrating  out,  with  respect,  to  r,  one  gets  the  joint,  pdf  of  X(,X2,f/  and  p  as 


/  \  r/2 

/(X| ,  X,,  u,  p)  a  I--"  J  !,p/ 2  '(U+/J.S5//  to)  ']r/2  f 

■  rm  '(]  I  p)  ,m">  (4.2 ft) 

Using  the  transformation  p/(\  I  p)  —  /?  provides  the  joint  pdf  of  X|,X2,f/  and  B  as 


/(x,.x2,»,R)  a  /?p/2+m  V"''"’  2,r/2~'(M : -i  BSS’u  t-cy)^''”3  0r/2  '  (4.20) 

Next  observe  from  (4.20)  and  (4.13)  that 

/7(/ti|/?.X|,X2,u,r)  —  f>,  =  (|  -  f?)x |  t  HW. 

Hence  the  III!  estimator  of  //t  is 

/?(//., |x, , x2, «)  -  x,  /o(/7|x, ,  x2.  »)(x,  W)  (4.27) 

Mill.,  from  (1.20),  one  gets 


/•;(tt|xl,x2.«)  - 


/„’  nr/2,m(u  f  t  cy) 


/?r/2im  I(u  |  |  ry)  ("I*"*  *dB 


(4.2«) 


Remark  4.1  From  simnltaneons  disgonalization  of  n,\j  1  and  ?i2l’2  1 ,  it,  is  easy  to 
show  from  (4. 18)  that 

f7|j+  —  L>22t  —  /7|2<  ~  (r»|l|  1  I  r,2k2  ')  'i  (4.20) 

so  that  from  (4.22)  one  gets 

-  (X,  x2)r (n j V,  1  I  n2V3  ')  '(x,  x2) 

which  is  precisely  the  numerator  of  F  defined  in  (3.1). 


(4.30) 


2) 


Remark  4.2  I),  is  sometimes  possible  to  rr’rl ur r*  (  hr  nbovn  !  1 1 i  estimator  to  an  HR 
estimator  of  the  form  X|  -  (rt>(F)  /  F)(x\  -  W).  Consider  for  example  the  situation 
when  r»  —  0  i.e.,  H  lias  the  improper  prior  /i2 (r)  —  r*  1 .  Now  writing  r  —  SSn/u, 
we  note  from  (4.20)  that  F  —  ( ( TI  i  t  »-2  2)p  t  2)v.  Also,  for  o  —  0,  it  follows  from 

( 1.28)  that 


F{n |x,,x2.w)  -  ['  nrl2'm( i  t  /?i.)  ‘"‘‘V  \w;  I'  n" 

)  X  ,,,  , 

I  fie/  \ 


J  ?n  1 


(1  I  fie) 


/«  ,  t'H  -  /</ 


o  VI 


fie  W  1 V(W 
I  I  fie  J  (I  i  fie)2 


^  ['  (  fiv  I 


I  t  fie 


!"i- m  1  edfi 


(I  -t  fie)2 


e -i/”/(,,")  »)'"'-f-V-  "  2,/„ 

n 


/■"/(«*».)  ,, 
'  L 


+  m  -  l 


(i  .,)  »■ 


r/e 


From  (4.21)  it  follows  tilt.  fi(fi  |  X|,X2,t/)  ran  he  expressed  as  <t>  '  (e)/e  —  <j>{r,’)jF. 
Next  note  that  integration  by  parts  gives  nr.merator  of  (4.21) 


>”1  (|  ,  2'r  Mmll 


1  n,’  2>r  ft 


\ 


rn 


>  i 


*  m  I 


(  (|  I  ”»}  nK 

..  |)  A, 

p  I  2 rn 

e{(u|  I  e i  2)p  *  2/i  2m  2} 

mn,  I/,  o  ,  , 

I  U  ~  (I  l/j  ^  (f  r/ 

n 

ll(’!i«  r  fmin  (•<..'?!)  nml 


(I  w) 


..j_  Hr 


4  <*  m  -  1 


du 


(4.22) 


C(fi|x,.x2,?/) 


p  t  2  m 

e{  (m|  t  n  j  2)/>  I  2/> 


2} 


(p  t  2m)((u,  t  n2  2)/>  t  2) 


f  ( ( u  i  I  u  2  2)/>  t  2^  2  m  2} 


4.22) 


so  that 


-M/0  ' 


(/>  1  2rM)((ri ,  j  n2  2 )/>  t  2) 


e2  2)/>  1  2<*>  2m  2 


2(p  2) 


**.»*  ’V.’V  "j*  "V*  V*  "V  ’»*  * V  "V  *«*  V  '/  \*»  ’ 


if  (/>  •  2ni)((ri,  \  r?2  2 )p  f  2)  <  2 (;>  --  2)((n,  \  n2  -  2 )p  -  2m  -  2)('.'  6  >  c)  *- — » 

;>{2»m(»?i  •  n2)  *  0}  -  p(j>  4 ) ( r? ,  f  n2  2)  -1m  f  8  which  holds  whenever  p  >  5 
and  m  ■  f(p  -1)(n  |  I  n  2  2)  f>}/2(nt  t  n2),  assuming  »|  1  n2  >  8.  Hence,  for  this 
choice  of  m.  </>(/’)  sat  isfies  condition  (ii)  of  Theorem  3.2  for  Q)  —  V,  —  V'2  =  /p.  Also, 
for  Q  l’|  --  V2  —  lv,  condition  (i)  of  Theorem  3.2  automatically  holds  when  p  ^  3. 

Finally,  noting  that  v  is  strictly  increasing  in  /*',  and  using  the  inequality 

f  V1  (  1  *  *')  ,,  (  n  :  I  *•  2  -  2)f*  .  - 

I  '  4  m  /  i  \  _ 2  fAm— 2  J 

I  ?/  ?  (  I  -  ?/  )  2 


f" . ’(  «  ).t 

-'ll  v  i  a ! 

V  I  Ilf"  '(I 


fm  '(I  H)"'ll,5  2'V'' 


(I  -  H) 


(d.3i) 


one  gels  after  clirect  differentiation  </>*(»•)  >  0.  Hence  ^‘(r)  is  |  in  v.  Hence,  condition 
(lii)  of  Theorem  3.3  also  holds.  Thereore,  when  a  —  0,  Q  =  I7)  =  V2  =  /,,,  p  >  5  and 
0  m  { ( p  t ) (»» i  I  v 2  2)  f)}/2(n,  I  n2),  the  11B  estimator  obtained  in  ('1-27) 


is  mmirnnx. 

Hon  lark  4.3  The  conclusion  given  in  Remark  3.2  bears  strong  resemblance  to  Straw- 
derman  (1971)  in  the  one  sample  problem.  However,  the  formulation  here  is  much 
more  general  than  the  one  given  in  Strawderman  (1971)  or  Strawderman  (1973). 
First.,  the  estimator  is  not,  shrunk  towards  zero  or  a  prespecified  point,  but  is  shrunk 
towards  the  pooled  mean.  In  Strawderman  (1971),  r  is  assued  to  be  known,  whereas 
in  Strawderman  (1973),  r  is  assumed  to  belong  t.o  (7,0c)  for  some  7  ^  0.  Our  for¬ 
mulation  is  also  more  general  than  the  one  given  in  Morris  (1983)  because  there  r  is 
assumed  known  and  (/>r)  1  is  given  a  uniform  prior  on  (0,oo). 


' r '/ .<V ■‘.C . ''i ■'./ .• t c  /  /  v  /,  c. yvy« y*  v  -r,  < 


A.  Proofs 


Proof  of  Lemma.  J.I:  To  prove  (4.13),  note  from  (4.8)  that 


(»27  -  Dl2Dn'Di2)Dn'Du  =  D22D;‘D m  -  Dtu 

~  n2V-'{  1  -1-  /,){/  -  l3V.rt2V2  1 }  •  '  •  ^  F,_ 1  ^  1 

P  n2  n, 

•>ii  F,~ 1  ( 1  f  p){I  -  TJF.n,  f,-'}  ~  pn2v2lv.,hvy' 

-  ((1  I  p)2/p)(l  -  Bv2V{'V.)\ /,-'(/  -  BV.ntVf  ')  -  pn2V2-'V.vy\\  ' 

-  ((II  P)2/P)(l  Bn2V2  1 V’. )(( 1  B)otV{1  f  o2VV')  /myy'F.n.F 

-  ((1  M)2//>){(1  -  0)n,K,-'  +  n2F2~l  -  77(1  -  B)n2V2 ‘F.n.F,-' 

-  /7n2 Fy  1 V, n2 Fy'}  -  pu2 Fy 1 F, » , Fy 1 
~  ((1  I  pY/pW  -  5)n,Fr'  4-  njVV*  "  -h  772u2Fy 1  F.n, Fy  1 } 

pn2V2  1  F,ri|  F,“ 1 

-  (u,Ff 1  -1-  n2V2')/U 

so  that 

7?n  vn{D22  -  /;,2)-'  =  /?f. 

and 

/422  n\2Du'nl2  r,  (n.vy'  t  n2v2')i)u'Dl2/n 

which  yields  by  symmetry 

D u  Vl2D2}D\2  ~  (n,F,  1  I  n2V2')l)2^D]2/B 

Hence  froiTi  (4.8)  one  gets 


DnD2}D]2)  ' 


/4(7;[2)  ’ /?„(",  F,  '  I  n2FyV 

i) ' 1  iy 1 ' 2  •  1  (i  i  p)o2iy'{/  -  /n.»2vy')i 

n,  n2  p 

l,-F.  '(I  PV.a2V2  ')F. 

H  I 

V,  F, 

B  n2F2  'F. 
n,  n, 


I 


o2V2 


s 


so  llial 


(/;.,  i)>2n2}f)]2)  'n.r,  ' 


1 


which  yields  from  (A. 2) 


'n.v',  1  i)Jtnu'i) „)  ' n , v,  ' 

=  /  /?l  -  n,r.  'V  .n.r,  1  nv.nx\\  1 

»! 

-  f  -  ny-{n2V2  '  I  r? ,  1  'jv-.n^v' 

”1 

-  (I-/?)/  (A.f>) 

(1.13)  now  follows  upon  writing  from  (4.12) 

A,,X|  I  yt,2x2  -  (/;„  -  Dy2D2}l)]2) -'n.VV'x,  t  Uxl' Dn(D22  D]2Dxl' Dl2)  ' n2V7 
-  \(Du  DnI)2}UTn)  'niVx'  -  l)u' I)n(I)22  -  l)J2Dn'DX2)  'n^V 
A  nu'D]2(D22  -  D}2Du'Du)-'(n2V2'x2  I  n.V.-'x,) 

and  invoking  (A. 2)  and  (A.f>).  The  proof  of  (4.14)  is  similar  and  hence  omitted. 

I'rnof  tif  lemma  ^.J?:Tlie  matrix  appearing  in  x||  |X)  in  (4.I5)  is  simplified  as 


u,V,  '  {(I  /?)/  f-  nnlVi  'V.}DU{(  I  -  W)/4-  /f«,V\VY '} 

(/Jn,V,-  'V,)D22{BnxV,V;')  I-  2{ ( 1  -  0)/  -I-  ffn1Vrr,V.}/;12(/?ii1l'.l'I-1) 
”.V,  1  (I  -  /f)2/;,,  -  H(1  n)(nx\\'VJ) „  f 

I  2  /  ?  ( 1  fll/Jnr/T,  1 


n,l'i  1 

(1  - 

/i)2{u.VV'(l  1 

1  P) 

priif7, 

1  k'.  ri ,  V', 

»(l 

1 V.  {n,V/,  1  ( 1 

1  P) 

p»;,r. 

T.n.F, 

mi 

/?){”. 1 

'(1 

i  p) 

1 4 

■r.r-.V' 

'}V*V 

\  '"i 

2II{\ 

H)n»  i 

r,  ’  v . 

n21'2  'V\ 

"  i  v, 

l)7v  |  V 

•,  'v>, 

vi  ' 

n,l',  1 

{'  (1 

i  ^)(  i 

n)2} 

n.f,  ' 

'Mi 

n)2 

2 /#(  1 

m(i 

i  p) 

n2} 

2pH{  1 

m». 

r,  'v. 

”  i  A,  T 

"1^1 

V  f  V  )•  . '  v.v.v  v  V V. v.v.v  v.  y.- v. v.  %-x-v-vv. 


I  2p B{  I  /i)» ,  IV '  ^ » 2 l'V '  ^  " i  IV  ‘ 

-  /?n,VV'  +n1K,-,V.nlK1-,{/i/(1  4  p)2 

-  2p/{\  I  p)2  p2/{  1  4-  p)2} 

-  l)\nlVl  1  -  »», V, ^T.n.lV'l 

Analogously,  the  matrix  appearing  in  x2  [-|x2  in  ( -1  - 1  r» )  mines  out  as 


(A.7) 


n2VV'  {(I  «)/  t  Bn2V2'V.}D27{(  1  -  /?)/  +  Dv2V.V2'} 

(Bv2V2 'V.)I)u{Bn2V.V2  ')  I  2{(l  «)/  I  fln2VV 1  F, } D]2(Bv2V.V2  1 ) 

-  W|r»2V2  '  -  v2V2'V.n2V2x\  (A.8) 

We  sliovv  that  the  matrix  appearing  in  xjf['|x2  in  (4.lr>)  is  null  i.e., 


{(I  B)I  I  Bn{V~'V,}I)u(Bn2V.V2  ') 

{Bn,Vt  'V.)l)22{{\  B)l  |-  m\n2V2  '} 

I  2  { ( I  U)J  \  fh,{V{  -  H)l  t  f1n2V.V2  '}  —  0 


(A.n) 


Left  hand  side  of  ( A .9) 


B{  I  B)I)uV.v2V2'  B{  I  B)nt\\  'V,D22  \  2(\ 
2B(l  B)(Di2V.n2V2'  4  ntVt~'V.Dl7) 

n7niv,  1  v, ( nt ,  f-  /;22  -  2/;12)F.n2ivI 

(using  (4.8)  and  (4.16)) 


II)2  I h 


m  I  /?){(<  I  p)nl\\  '  pn[\\'V.r,{V,'}V.r,2V2' 

//(I  B)»\V{  'l.{(l  1  1  -  P»2Y’2 ~'\\n2V2  '} 

2(1  li)2pn \Vx'V.»2V2  1  l  2/7(1  -  /?)pw,r(  T.iijl,  1 V'  n  2  V2“ 1 
2/4(1  /4)/m,l'  'r.n,^  '^."2^  '  /42»,  1',  1  ' 

(using  V,  1  —  1  I  v2\ \  ' ) 


n.V,  1 V.  n2  V  j  '|  2/4(1  -  /4)(l  I  p)  4  2p(  I  B)2  B2  \  pB(\  B)  4-2p/4(l  -  B)\ 
0,  sinre  B  —  pj ( I  4  p). 


Finally,  we  evaluate  the  matrix  appearing  in  x \  |-]X|  in  (4.15),  namely 


-(Bn2V2'V.)Du{(\  -  13)1  I  13V,nxV-x)  -  {(I  B)l 

I-  lhHV2'V.}I)72{in\n,Y\') 
4  2{Bn2V2'V\)Dn{BV^Vy 


n{  i  -  n)v2v2'v.i)u  -  n(\  B)i)22v.„yvx  1 

l32n2V2  1  V\(Wu  4  /;72  2/;„)l' ».,K,  1 


(using  (4.8)  and  (4.16)) 


---  -n(  l  -  B)n2V2lV.{(  I  I  /»)».,  V,  1  - 

ft(l  tf){(1  +/»)n,^  1  -  pi>2V2  1  V'.njl’j'1  l/,~ 1 
-  n2n2V2~'V.  U,!/,1 

(using  F  1  =  n,F,  1  {  n2l/2-') 


=  -2  Bn2V2'V,n\Vx' 


Combining  ( A.7)-(  A.  10),  the  proof  of  the  lemma  is  complete. 
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